pvsR: An Open Source Interface to Big Data on the American Political Sphere
نویسندگان
چکیده
Digital data from the political sphere is abundant, omnipresent, and more and more directly accessible through the Internet. Project Vote Smart (PVS) is a prominent example of this big public data and covers various aspects of U.S. politics in astonishing detail. Despite the vast potential of PVS' data for political science, economics, and sociology, it is hardly used in empirical research. The systematic compilation of semi-structured data can be complicated and time consuming as the data format is not designed for conventional scientific research. This paper presents a new tool that makes the data easily accessible to a broad scientific community. We provide the software called pvsR as an add-on to the R programming environment for statistical computing. This open source interface (OSI) serves as a direct link between a statistical analysis and the large PVS database. The free and open code is expected to substantially reduce the cost of research with PVS' new big public data in a vast variety of possible applications. We discuss its advantages vis-à-vis traditional methods of data generation as well as already existing interfaces. The validity of the library is documented based on an illustration involving female representation in local politics. In addition, pvsR facilitates the replication of research with PVS data at low costs, including the pre-processing of data. Similar OSIs are recommended for other big public databases.
منابع مشابه
Possibility of the effect of the Internet on 'Public Sphere' in Jurgen Habermas's Tthought?"
In recent decades, "Public Sphere" is one of the most important concepts in political science. Jurgen Habermas the famous thinker in this approach is the first to use this concept in critical thinking, where he demonstrates how networking is used for communicative actions. Habermas has not included the Internet as an important part of his thought process in the "public sphere", however, I think...
متن کاملA Fuzzy TOPSIS Approach for Big Data Analytics Platform Selection
Big data sizes are constantly increasing. Big data analytics is where advanced analytic techniques are applied on big data sets. Analytics based on large data samples reveals and leverages business change. The popularity of big data analytics platforms, which are often available as open-source, has not remained unnoticed by big companies. Google uses MapReduce for PageRank and inverted indexes....
متن کاملDrug Discovery and Big Linked Data
A large part of the daily practice of a researcher doing in vitro Drug Discovery is comparing and manually matching high-quality information from multiple disciplines in the Life and Biomedical Sciences. The Open PHACTS Discovery Platform is an initiative to integrate publicly available data relevant for both academia and the pharmaceutical industry. It integrates numerous datasets including fo...
متن کاملAmerican Decline in the Perspective of Democrats and Republicans: Otherization and Construction of American Identity in Election Speeches
American decline has received substantial attention from Iranian political and academic circles, with few analysts paying attention to the domestic debate on the concept. One of the overlooked aspects is the different ways that liberals and conservatives define and construct it. This difference is manifested in the perception of American identity and on the global role of the US, and intensifie...
متن کاملSocial Media and Politics: Examining Indonesians’ Political Knowledge on Facebook
The Internet and social media have played a significant role in contemporarypolitical sphere of Indonesia. In particular, they have been widely usedfor political activism and discussion; but whether the discussions areconstructive is another issue. Constructive political discussion requiresseveral preconditions; one of the most important requirements is rationalreasoning. Citizens must be equip...
متن کامل